Search CORE

2,337 research outputs found

Constrained speaker linking

Author: Brümmer Niko
van Leeuwen David A.
Publication venue
Publication date: 01/01/2014
Field of study

In this paper we study speaker linking (a.k.a.\ partitioning) given constraints of the distribution of speaker identities over speech recordings. Specifically, we show that the intractable partitioning problem becomes tractable when the constraints pre-partition the data in smaller cliques with non-overlapping speakers. The surprisingly common case where speakers in telephone conversations are known, but the assignment of channels to identities is unspecified, is treated in a Bayesian way. We show that for the Dutch CGN database, where this channel assignment task is at hand, a lightweight speaker recognition system can quite effectively solve the channel assignment problem, with 93% of the cliques solved. We further show that the posterior distribution over channel assignment configurations is well calibrated.Comment: Submitted to Interspeech 2014, some typos fixe

arXiv.org e-Print Archive

Radboud Repository

Creating a Dutch testbed to evaluate the retrieval from textual databases

Author: Hiemstra Djoerd
Leeuwen David A. van
Publication venue
Publication date: 01/01/2002
Field of study

This paper describes the first large-scale evaluation of information retrieval systems using Dutch documents and queries. We describe in detail the characteristics of the Dutch test data, which is part of the official CLEF multilingual texttual database, and give an overview of the experimental results of companies and research institutions that participated in the first official Dutch CLEF experiments. Judging from these experiments, the handling of language-specific issues of Dutch, like for instance simple morphology and compound nouns, significantly improves the performance of information retrieval systems in many cases. Careful examination of the test collection shows that it serves as a reliable tool for the evaluation of information retrieval systems in the future

CiteSeerX

University of Twente Research Information

A comparison of linear and non-linear calibrations for speaker recognition

Author: Brümmer Niko
Swart Albert
van Leeuwen David
Publication venue
Publication date: 01/01/2014
Field of study

In recent work on both generative and discriminative score to log-likelihood-ratio calibration, it was shown that linear transforms give good accuracy only for a limited range of operating points. Moreover, these methods required tailoring of the calibration training objective functions in order to target the desired region of best accuracy. Here, we generalize the linear recipes to non-linear ones. We experiment with a non-linear, non-parametric, discriminative PAV solution, as well as parametric, generative, maximum-likelihood solutions that use Gaussian, Student's T and normal-inverse-Gaussian score distributions. Experiments on NIST SRE'12 scores suggest that the non-linear methods provide wider ranges of optimal accuracy and can be trained without having to resort to objective function tailoring.Comment: accepted for Odyssey 2014: The Speaker and Language Recognition Worksho

arXiv.org e-Print Archive

Radboud Repository

A Fast Chi-squared Technique For Period Search of Irregularly Sampled Data

Author: Baluev
David M. Palmer
Galassi
Otero
Press
van Leeuwen
Publication venue: 'IOP Publishing'
Publication date: 13/01/2009
Field of study

A new, computationally- and statistically-efficient algorithm, the Fast

\chi^2

algorithm, can find a periodic signal with harmonic content in irregularly-sampled data with non-uniform errors. The algorithm calculates the minimized

\chi^2

as a function of frequency at the desired number of harmonics, using Fast Fourier Transforms to provide

O (N \log N)

performance. The code for a reference implementation is provided.Comment: Source code for the reference implementation is available at http://public.lanl.gov/palmer/fastchi.html . Accepted by ApJ. 24 pages, 4 figure

arXiv.org e-Print Archive

Crossref

Robust audio indexing for Dutch spoken-word collections

Author: Huijbregts Marijn
Jong Franciska de
Leeuwen David van
Ordelman Roeland
Publication venue: KNAW
Publication date: 01/01/2005
Field of study

Abstract—Whereas the growth of storage capacity is in accordance with widely acknowledged predictions, the possibilities to index and access the archives created is lagging behind. This is especially the case in the oral history domain and much of the rich content in these collections runs the risk to remain inaccessible for lack of robust search technologies. This paper addresses the history and development of robust audio indexing technology for searching Dutch spoken-word collections and compares Dutch audio indexing in the well-studied broadcast news domain with an oral-history case-study. It is concluded that despite significant advances in Dutch audio indexing technology and demonstrated applicability in several domains, further research is indispensable for successful automatic disclosure of spoken-word collections

University of Twente Research Information

Are small firms really sub-optimal?: compensating factor differentials in small Dutch manufacturing firms

Author: Bert Menkveld
David Audretsch
Gerco van Leeuwen
Roy Thurik
Publication venue
Publication date
Field of study

Onderzoek naar de vraag waarom er zoveel kleine bedrijven kunnen bestaan, ondanks de heersende opvatting dat dit bedrijven zijn die op sub-optimale schaal presteren. Kleine bedrijven blijken productiefactoren verschillend te belonen en toe te passen ten opzichte van grote bedrijven, en langs deze weg bedrijfsgroottenadelen te compenseren. Uit een steekproef onder 7.000 Nederlandse industriële bedrijven blijkt dat deze strategie van compenserende factordifferentialen toegepast wordt in Europa. Enerzijds lijkt deze strategie te leiden tot een nettoverlies voor de economie. Anderzijds suggereren de positieve relaties tussen bedrijfsleeftijd en werknemerscompensatie, en bedrijfsleeftijd en productiviteit dat er minstens een tendens is dat het inefficiënte bedrijf van vandaag het efficiënte bedrijf van de toekomst kan worden.

Research Papers in Economics

Speech-based recognition of self-reported and observed emotion in a dimensional space

Author: Jong Franciska M.G. de
Leeuwen David A. van
Truong Khiet P.
Publication venue: Elsevier
Publication date: 01/01/2012
Field of study

The differences between self-reported and observed emotion have only marginally been investigated in the context of speech-based automatic emotion recognition. We address this issue by comparing self-reported emotion ratings to observed emotion ratings and look at how differences between these two types of ratings affect the development and performance of automatic emotion recognizers developed with these ratings. A dimensional approach to emotion modeling is adopted: the ratings are based on continuous arousal and valence scales. We describe the TNO-Gaming Corpus that contains spontaneous vocal and facial expressions elicited via a multiplayer videogame and that includes emotion annotations obtained via self-report and observation by outside observers. Comparisons show that there are discrepancies between self-reported and observed emotion ratings which are also reflected in the performance of the emotion recognizers developed. Using Support Vector Regression in combination with acoustic and textual features, recognizers of arousal and valence are developed that can predict points in a 2-dimensional arousal-valence space. The results of these recognizers show that the self-reported emotion is much harder to recognize than the observed emotion, and that averaging ratings from multiple observers improves performance

Crossref

Radboud Repository

University of Twente Research Information

Creating a Dutch Information Retrieval Test Corpus

Author: Hiemstra Djoerd
van Leeuwen David
Publication venue: Rodopi
Publication date: 01/01/2002
Field of study

University of Twente Research Information